Communication method, apparatus, electronic device, and storage medium

ABSTRACT

Communication method, apparatus, electronic device and storage medium are provided. The communication method is applied to a video server having an IP network connection with a telephone network node, and includes: receiving a first video call including an identification of a first dialing device from the telephone network node based on the IP network connection; determining a first video channel with the first dialing device and second video channels with a first IP terminal device and a second IP terminal device, wherein the first dialing device and the telephone network node are located in a telephone network, and the first IP terminal device, the second IP terminal device and the video server are located in an IP network; and performing channel combination on the first video channel and the second video channels to establish a multi-party video call between the first dialing device and multiple IP terminal devices.

CROSS REFERENCE TO RELATED PATENT APPLICATIONS

This application claims priority to Chinese Patent Application No.202110518868.0, filed on 12 May 2021 and entitled “Communication Method,Apparatus, Electronic Device, and Storage Medium,” which is herebyincorporated by reference in its entirety.

TECHNICAL FIELD

The present disclosure relates to the technical field of computers, andin particular, to communication methods, apparatuses, electronicdevices, and storage media.

BACKGROUND

Mobile communication technology is one of the important achievements ofcomputer networks. Mobile communication technology has undergone thedevelopment of the first, second, third and fourth generationtechnologies. At present, the fifth generation mobile communicationtechnology is the latest generation of cellular mobile communicationtechnology, and is also an extension after 4G (LTE-A, WiMax), 3G (UMTS,LTE) and 2G (GSM) systems. The performance goals of 5G are high datarates, reduced latency, energy savings, lower costs, increased systemcapacity, and large-scale device connectivity.

The fifth generation mobile communication technology can support videocall functions between mobile phones through a cellular network, but thecommunication cost still has room for improvement.

SUMMARY

This Summary is provided to introduce a selection of concepts in asimplified form that are further described below in the DetailedDescription. This Summary is not intended to identify all key featuresor essential features of the claimed subject matter, nor is it intendedto be used alone as an aid in determining the scope of the claimedsubject matter. The term “techniques,” for instance, may refer todevice(s), system(s), method(s) and/or processor-readable/computerreadable instructions as permitted by the context above and throughoutthe present disclosure.

In view of this, embodiments of the present disclosure providecommunication method, apparatus, electronic device, and storage mediumto solve or alleviate the above-mentioned problems.

According to the present disclosure, a communication method is provided,which is applied to a video server having an IP network connection witha telephone network node. The method includes: receiving a first videocall sent from the telephone network node based on the IP networkconnection, the video call including an identification of a firstdialing device; determining a first video channel with the first dialingdevice and second video channels with a first IP terminal device and asecond IP terminal device in response to the first video call, whereinthe first dialing device and the telephone network node are located in atelephone network, and the first IP terminal device, the second IPterminal device and the video server are located in an IP network; andperforming channel combination on the first video channel and the secondvideo channels to establish a multi-party video call between the firstdialing device and multiple IP terminal devices.

According to the present disclosure, a communication method is provided,which is applied to a video server having an IP network connection witha telephone network node. The method includes: obtaining a video callsent by an IP terminal device, the IP terminal device and the videoserver being located in an IP network; determining first media channelswith a first dialing device and a second dialing device, and a secondmedia channel with the IP terminal device in response to the video call,wherein the first dialing device and the second dialing device arelocated in a telephone network; and performing channel combination onthe first media channels and the second media channel to establish amulti-party video call among the IP terminal device, the first dialingdevice, and the second dialing device.

According to the present disclosure, a communication method is provided,which is applied to a telephone network node having an IP networkconnection with a video server. The method includes: forwarding areceived video call to the video server based on the IP networkconnection, wherein a dialing device that initiates the video call andthe telephone network node are located in a telephone network; receivinga session request sent by the video server based on the IP networkconnection in response to the video call, and forwarding the sessionrequest to the dialing device; sending a session confirmation receivedfrom the dialing device to the video server based on the IP networkconnection to establish a video channel between the dialing device andthe video server; and forwarding video call data between the dialingdevice and the video server based on the video channel.

According to the present disclosure, a communication method is provided,which is applied to a dialing device, and includes: sending a video callto a telephone network node; receiving a session request sent by thetelephone network node in response to the video call request, thesession request including an identification of the video server;establishing a video channel with the video server in response to thesession request; and sending video call data to the telephone networknode or receiving video call data from the telephone network node basedon the video channel.

According to the present disclosure, a communication apparatus isprovided, which is applied to a video server having an IP networkconnection with a telephone network node. The apparatus includes: areceiving module that receives a first video call sent by the telephonenetwork node based on the IP network connection, the video callincluding an identification of a first dialing device; a determinationmodule that determines a first video channel with the first dialingdevice and second video channels with a first IP terminal device and asecond IP terminal device in response to the first video call, whereinthe first dialing device and the telephone network node are located in atelephone network, the first IP terminal device, the second IP terminaldevice, and the video server are located in an IP network; and acombination module performs channel combination on the first videochannel and the second video channels to establish a multi-party videocall between the first dialing device and multiple IP terminal devices.

According to the present disclosure, a communication apparatus isprovided, which is applied to a video server having an IP networkconnection with a telephone network node. The apparatus includes: anacquisition module that obtains a video call sent by an IP terminaldevice, the IP terminal device and the video server being located in anIP network; a determination module that determines first media channelswith the first dialing device and the second dialing device and a secondmedia channel with the IP terminal device in response to the video call,wherein the first dialing device and the second dialing device arelocated in a telephone network; and a combination module that performschannel combination on the first media channels and the second mediachannel to establish a multi-party video call among the IP terminaldevice, the first dialing device and the second dialing device.

According to the present disclosure, a communication apparatus isprovided, which is applied to a telephone network node having an IPnetwork connection with a video server. The apparatus includes: a firsttransceiver module that forwards a received video call to the videoserver based on the IP network connection, wherein a dialing device thatinitiates the video call and the telephone network node are located in atelephone network; a second transceiver module that receives a sessionrequest sent by the video server in response to the video call based onthe IP network connection, and forwards the session request to thedialing device; a third transceiver module that sends a sessionconfirmation received from the dialing device to the video server basedon the IP network connection to establish a video channel between thedialing device and the video server; and a fourth transceiver module,that forwards video call data between the dialing device and the videoserver based on the video channel.

According to the present disclosure, a communication apparatus, which isapplied to a dialing device, includes: a first transceiver module thatsends a video call to a telephone network node; a second transceivermodule that receives a session request sent by the telephone networknode in response to the video call, the session request including anidentification of a video server; an establishing module thatestablishes a video channel with the video server in response to thesession request; and a third transceiver module that sends video calldata to the telephone network node or receives video call data from thetelephone network node based on the video channel.

According to the present disclosure, a communication device is provided.The device includes: a processor, a memory, a communication interface,and a communication bus. The processor, the memory and the communicationinterface complete mutual communication through the communication bus.The memory is used for storing at least one executable instruction. Theexecutable instruction enables the processor to perform operationscorresponding to the method according to any one of the first aspect tothe fourth aspect.

According to the present disclosure, a storage medium storing a computerprogram is provided. When the program is executed by a processor, themethod according to any one of the first to fourth aspects isimplemented.

In the present disclosure, since a video server having an IP networkconnection with a telephone network node realizes a video channel with adialing device located in the same telephone network of the telephonenetwork node, the transmission of video data between the IP network andthe telephone network is realized. In addition, since respective videochannels of multiple IP terminal devices can be configured in the IPnetwork, a multi-party video call between multiple IP terminal devicesand the dialing device is realized, which improves the performance ofthe video service capability of the IP network and the telephonenetwork, and saves the communication cost.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to describe the technical solutions of the embodiments of thepresent disclosure more clearly, accompanying drawings that need to beused for describing the embodiments are briefly described as follows.Apparently, the drawings in the following description represent onlysome examples described in the embodiments of the present disclosure.For one of ordinary skill in the art, other drawings can also beobtained according to these drawings.

FIG. 1 is a schematic flowchart of a communication method according toan embodiment of the present disclosure.

FIG. 2A is a schematic interaction diagram of a communication methodaccording to another embodiment of the present disclosure.

FIG. 2B is a schematic interaction diagram of a communication methodaccording to another embodiment of the present disclosure.

FIG. 3 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure.

FIG. 4 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure.

FIG. 5 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure.

FIG. 6 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure.

FIG. 7 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure.

FIG. 8 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure.

FIG. 9 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure.

FIG. 10 is a hardware structure of an electronic device according toanother embodiment of the present disclosure.

DETAILED DESCRIPTION

In order to make one skilled in the art better understand the technicalsolutions in the embodiments of the present disclosure, the technicalsolutions in the embodiments of the present disclosure will be clearlyand completely described below with reference to the accompanyingdrawings in the embodiments of the present disclosure. The describedembodiments represent only some but not all of the embodiments of thepresent disclosure. All other embodiments obtained by one of ordinaryskill in the art based on the examples in the embodiments of the presentdisclosure shall fall within the scope of protection of the embodimentsof the present disclosure.

Specific implementations of the embodiments of the present disclosureare further described below with reference to the accompanying drawingsof the embodiments of the present disclosure.

FIG. 1 is a schematic flowchart of a communication method according toan embodiment of the present disclosure. The communication method ofFIG. 1 is applied to a video server having an IP network connection witha telephone network node. It should be understood that the video serverincludes, but is not limited to, a server with video processingcapability, a server cluster, a cloud server such as a dedicated cloud,a private cloud, and a public cloud, etc.

The method includes the following steps:

110: Receive a first video call sent by the telephone network node basedon the IP network connection, the video call including an identificationof a first dialing device.

It should be understood that the so-called telephone network of thetelephone network node includes, but is not limited to, a publicswitched telephone network (PSTN), a public land mobile network (PLMNfor short), etc. The telephone network node may include a gatewayconnecting the telephone network to the IP network. The InternetProtocol (IP) network is an Internet layer protocol in TCP/IP systems.Using an IP network can improve the scalability of the network. Forexample, the interconnection of large-scale and heterogeneous networksis realized, and the coupling relationship between top-level networkapplications and bottom-level network technologies is divided tofacilitate independent developments of the two (top-level networkapplications and bottom-level network technologies). In other words, thetelephone network may be a non-IP network.

It also should be understood that the telephone network node may beconfigured with an IP Multimedia Subsystem (IMS) to help establish amedia channel that is capable of conducting video communication with theIP network. Messages (e.g., video calls, etc.) received by the telephonenetwork node from a dialing device are in compliance with thecommunication protocol (e.g., statutory standard protocol or de factostandard protocol) of the telephone operator deploying theabove-mentioned telephone network. Telephone operators provide IPnetwork traffic services and IP video telephone services over thetelephone network, wherein initiating a video call corresponds to an IPvideo telephone service.

It also should be understood that the dialing device may include amobile phone, a landline phone, any electronic device configured withlandline phone functions, and the like.

It also should be understood that receiving the first video call sent bythe telephone network node based on the IP network connection may bereceiving the first video call initiated by the first dialing device andforwarded by the telephone network node based on the IP networkconnection. In other words, the first dialing device sends the firstvideo call to the telephone network node, and the telephone network nodethen forwards the first video call to the video server. When performingthe above forwarding processing, the telephone network node can use anIMS system to change the communication protocol of the first video callfrom the communication protocol of the telephone network to the IPcommunication protocol, and send the video call based on the IP network.

120: Determine a first video channel with the first dialing device, andsecond video channels with a first IP terminal device and a second IPterminal device in response to the first video call, wherein the firstdialing device and the telephone network node are located in thetelephone network, and the first IP terminal device, the second IPterminal device and the video server are located in the IP network.

It should be understood that the video server, the first IP terminaldevice and the second IP terminal device are devices belonging to thesame service architecture. For example, the video server can be used asa video processing device dedicated for the first IP terminal device andthe second IP terminal device. In addition, the video server can alsoserve as a third-party video server other than the first IP terminaldevice and the second IP terminal device. Before the dialing deviceinitiates a video call, at least one of the first IP terminal device andthe second IP terminal device may have a connection with the videoserver based on the second video channel(s).

It also should be understood that determining the first video channeland the second video channels in response to the first video call mayinclude the following implementations. In a first example, the firstvideo channel may be established in response to the first video call,and the second video channels are determined, wherein the second videochannels were established prior to receiving the first video call. In asecond example, the first video channel and the second video channelsmay be established in response to the first video call.

It also should be understood that establishing at least one of the firstvideo channel and the second video channels may be performed by adoptinga Session Initiation Protocol (SIP). Other media channel establishingprotocols may also be adopted, which are not limited by the embodimentsof the present disclosure.

130: Perform channel combination on the first video channel and thesecond video channels to establish a multi-party video call between thefirst dialing device and multiple IP terminal devices.

It should understood that, for performing channel combination on thefirst video channel and the second video channels, data transmittedthrough the first video channel and data transmitted through thecorresponding second video channels of the first IP terminal device andthe second IP terminal device can be monitored. The data of these threevideo channels is processed and combined in real time, and then sent tothe device corresponding to each video channel. It is also possible toestablish a video channel between the dialing device and the first IPterminal device based on the first video channel and the second videochannel of the first IP terminal device, and establish a video channelbetween the dialing device and the second IP terminal device based onthe first video channel and the second video channel of the second IPterminal device. These two video channels may then be monitored forperforming channel combination.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

In other words, the communication between the dialing device and thetelephone network node is based on a video telephone protocol (aprotocol in 5G telephone network). The communication between thetelephone network node and the video server is based on the IP network.The communication between the video server and the IP terminal device isbased on the IP network. Specifically, the telephone network nodereceives a video call from the dialing device based on a video telephoneprotocol, and forwards the video call to the video server based on theIP network. In response to the video call, the video server sends asession request to the telephone network node based on the IP network.The telephone network node receives a session request based on the IPnetwork and forwards the session request to the dialing device based onthe video telephone protocol. In response to the session request, thedialing device sends a session confirmation to the telephone networknode based on the video telephone protocol. The telephone network nodeforwards the session confirmation based on the IP network.

In some examples, the first video channel with the first dialing devicemay be established after the second video channels are established withthe first IP terminal device and the second IP terminal device. It isalso possible to establish the second video channels with the first IPterminal device and the second IP terminal device after the first videochannel with the first dialing device is established.

In other examples, the second video channels with the first IP terminaldevice and the second IP terminal device can be establishedsimultaneously in response to a video call request. It is also possibleto establish a video call with the first IP terminal device, and thenadd the second IP terminal device to the video call. For example, whenconducting a video call based on a first media channel of the firstdialing device and a second media channel of the first IP terminaldevice, the method may further include: in response to the video call,notifying the first dialing device and the first IP terminal device thatthe second IP terminal device is about to join. For example, anidentification of the second IP terminal device is sent to the firstdialing device and the first IP terminal device, and respective localvideo processing modules of the first dialing device and the first IPterminal device can respond to the identification and adjust respectivevideo windows, etc. In response to the identification, the first dialingdevice and the first IP terminal device can monitor respective localnetwork environments, and send reports to the video server. The videoserver can send a session request to the second IP terminal device whenboth the first dialing device and the first IP terminal device satisfyconditions for a multi-party video call, to establish a second videochannel with the second IP terminal device.

In other examples, the same communication method can also be used whenthe second dialing device joins the above-mentioned multi-party videocall, which is not repeated in the examples of the present disclosure.

In implementations, determining the first video channel with the firstdialing device in response to the first video call includes: sending asession request to the telephone network node based on the IP networkconnection in response to the first video call, the session requestincluding an identification of the video server; and receiving a sessionconfirmation sent by the telephone network node based on the IP networkconnection, and establishing the first video channel, wherein thesession confirmation includes an identification of the first dialingdevice.

Since the first video channel is established through the session requestin response to the first video call, establishing the first videochannel is realized through fast and convenient interactions.

In implementations, performing the channel combination on the firstvideo channel and the second video channel includes: obtaining videocall data of the first dialing device, video call data of the first IPterminal device and the second IP terminal device based on the firstvideo channel of the first dialing device and the second video channelsof the first IP terminal device and the second IP terminal devicerespectively; generating real-time combined call data according to thevideo call data of the first dialing device, the first IP terminaldevice and the second IP terminal device; and sending the real-timecombined call data to the first dialing device, the first IP terminaldevice and the second IP terminal device based on the first videochannel of the first dialing device and the second video channels of thefirst IP terminal device and the second IP terminal device respectively.

Since the video data obtained from various video channels can becombined and processed to realize the integration of the various videochannels, it is not necessary to improve the network architecture forthe multi-party call, and still the multi-party video call can berealized.

In implementations, performing the channel combination on the firstvideo channel and the second video channel includes: obtaining videocall data of the first dialing device, video call data of the first IPterminal device and the second IP terminal device based on the firstvideo channel of the first dialing device and the second video channelsof the first IP terminal device and the second IP terminal devicerespectively; and recording the video call data of the first dialingdevice, the first IP terminal device and the second IP terminal devicein real time as a target video file, and locally saving the target videofile.

The solution of this example can record the video call data of the firstdialing device, the first IP terminal device, and the second IP terminaldevice as a target video file in real time, and save the target videofile locally, thus realizing recording services under video customerservice scenarios, for example.

In implementations, determining the second video channels with the firstIP terminal device and the second IP terminal device in response to thefirst video call includes: querying the current service status of eachagent terminal device in the IP network in response to the first videocall, the current service status indicating being currently available orcurrently occupied; and determining the first IP terminal device and thesecond IP terminal device from the agent terminal devices whose currentservice status indicates currently available.

Since the first IP terminal device and the second IP terminal device arecurrently available devices determined from various agent terminaldevices, services under multi-agent customer service scenarios, forexample, are implemented. For example, in a multi-agent customer servicescenario, the first IP terminal device may be a technical service agent,and the second IP terminal device may be a customer service agent,thereby improving the service quality and efficiency of the customerservice.

In implementations, the method further includes: using the web socketprotocol to encapsulate a SIP session request; sending the SIP sessionrequest to respective browser clients installed on the first IP terminaldevice and the second IP terminal device via the IP network; andreceiving session confirmations sent by the respective browser clientsbased on the web socket protocol via the IP network, and establishingthe second video channels of the first IP terminal device and the secondIP terminal device.

It should be understood that WebSocket is a protocol for full-duplexcommunication over a single TCP connection. Since SIP session requestsand SIP session confirmations can reliably and efficiently establishvideo transmission channels in the IP network, and the web socketprotocol is compatible with the software framework of browser clients,the second video channels are quickly established.

In implementations, the method further includes: obtaining a userinquiry of the first dialing device though the first video channel;identifying the user inquiry, and determining a customer service replycorresponding to the user inquiry; and returning the customer servicereply to the first dialing device though the first video channel.

Since the video server can identify user inquiries and combine videochannels, both the video data processing efficiency and the datacommunication efficiency are taken into account.

In implementations, the method further includes: determining a thirdvideo channel with a second dialing device in response to a second videocall, wherein the second dialing device and the telephone network nodeare located in the telephone network; and performing channel combinationon the first video channel, the second video channel, and the thirdvideo channel, to add the second dialing device to the multi-party videocall.

Since channel combination can be performed on the first video channel,the second video channel and the third video channel, adding a newcommunication device in the multi-party video communication is realized.

FIG. 2A is a schematic interaction diagram of a communication methodaccording to another embodiment of the present disclosure. Theinteraction diagram of this example involves an interaction processamong a dialing device, a network node configured with an IMS telephone,a video server, an agent terminal A as a first IP terminal device, andan agent terminal B as a second IP terminal device.

It should be understood that the method of this example may include moreor fewer steps, and may also include alternative steps, which is notlimited in this example.

It also should be understood that the IMS-configured telephone networknode may act as a telephone network gateway connected to the IP network.In other words, the dialing device (e.g., the first dialing devicedescribed above) may be located in a telephone network with thetelephone network node. The telephone network node may be located in theIP network with the video server, the agent terminal A, and the agentterminal B. the telephone network node and the video server have an IPnetwork connection, and the telephone network node has an IP networkconnection with at least one of the agent terminal A and the agentterminal B, or may have no IP network connection with both of them.

In step 2101, the dialing device initiates a video call to the telephonenetwork node.

In step 2102, the telephone network node forwards the video call to thevideo server.

In step 2103, the video server sends a first session request to thetelephone network node.

In step 2104, the video server sends a second session request to theagent terminal A.

In step 2105, the telephone network node forwards the first sessionrequest to the dialing device.

In step 2106, the agent terminal A sends a second session confirmationto the video server.

In step 2107, the video server sends a second session request to theagent terminal B.

In step 2108, the dialing device sends a first session confirmation tothe telephone network node.

In step 2109, the telephone network node sends a first sessionconfirmation to the video server. Specifically, the first sessionconfirmation is a response to the first session request.

In step 2110, the agent terminal B sends a second session confirmationto the video server. Specifically, the second session confirmation is aresponse to the second session request.

In step 2111, a first video channel between the dialing device and thevideo server is established. Specifically, the first video channel maybe established in response to receiving the first session confirmation.

In step 2112, respective second video channels of the video server withthe agent terminal A and the agent terminal B are established.Specifically, the second video channels may be established in responseto receiving the second session confirmations.

In step 2113, channel combination is performed on the first videochannel and the second video channels to establish a multi-party videocall. Specifically, after the video channel established later in timeamong the first video channel and the second video channels isestablished, the video server can monitor transmitted data in the firstvideo channel and the second video channels, and generate real-timecombined video data, such as encapsulating real-time combined video datainto a sequence of packets using a specific data encapsulation format.

FIG. 2B is a schematic interaction diagram of a communication methodaccording to another embodiment of the present disclosure. Theinteraction diagram of this example involves an interaction processamong a dialing device, a network node configured with an IMS telephone,a video server, an agent terminal A as a first IP terminal device, andan agent terminal B as a second IP terminal device.

It should be understood that the method of this example may include moreor fewer steps, and may also include alternative steps, which is notlimited in this example.

It also should be understood that the IMS-configured telephone networknode may act as a telephone network gateway connected to an IP network.In other words, the dialing device (e.g., the first dialing devicedescribed above) may be located in a telephone network with thetelephone network node. The telephone network node may be located in theIP network with the video server, the agent terminal A, and the agentterminal B. The telephone network node and the video server have an IPnetwork connection, and the telephone network node has an IP networkconnection with at least one of the agent terminal A and the agentterminal B, or may have no IP network connection with both of them.

In the previous example, the first video channel and the second videochannels are established at the same time. In response to a video call,a first session request is sent to the telephone network node. Inresponse to the video call, second session request(s) is/are sent to atleast one of the agent terminal A and the agent terminal B.

In this example, the second video channels are completed before thevideo call is initiated.

In step 2101, the telephone network node forwards the video call to thevideo server.

In step 2102, the agent terminal A sends a second session confirmationto the video server.

In step 2103, the video server sends a second session request to theagent terminal B.

In step 2114, the agent terminal B sends a second session confirmationto the video server.

In step 2115, respective second video channels of the video server withthe agent terminal A and the agent terminal B are established.

In step 2106, the dialing device initiates a video call to the telephonenetwork node.

In step 2107, the video server sends a first session request to thetelephone network node.

In step 2108, the video server sends a second session request to theagent terminal A.

In step 2109, the telephone network node forwards the first sessionrequest to the dialing device.

In step 2110, the dialing device sends a first session confirmation tothe telephone network node.

In step 2111, the telephone network node sends a first sessionconfirmation to the video server.

In step 2112, a first video channel between the dialing device and thevideo server is established.

In step 2113, channel combination is performed on the first videochannel and the second video channels to establish a multi-party videocall.

FIG. 3 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure. The communication methodof FIG. 3 is applied to a video server having an IP network connectionwith a telephone network node. It should be understood that the videoserver may include, but is not limited to, a server with videoprocessing capability, a server cluster, a cloud server such as adedicated cloud, a private cloud, and a public cloud, etc.

The method includes the following steps:

310: Obtain the video call sent by an IP terminal device, the IPterminal device and the video server being located in the IP network.

320: Determine first media channels with a first dialing device and asecond dialing device and a second media channel with the IP terminaldevice in response to the video call, wherein the first dialing deviceand the second dialing device are located within the telephone network.

330: Perform channel combination on the first media channels and thesecond media channel to establish a multi-party video call between theIP terminal device, the first dialing device, and the second dialingdevice.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

In other words, the communication between the dialing device and thetelephone network node is based on a video telephone protocol (aprotocol in 5G telephone network). The communication between thetelephone network node and the video server is based on the IP network.The communication between the video server and the IP terminal device isbased on the IP network. Specifically, the telephone network nodereceives a video call from the dialing device based on a video telephoneprotocol, and forwards the video call to the video server based on theIP network. In response to the video call, the video server sends asession request to the telephone network node based on the IP network.The telephone network node receives a session request based on the IPnetwork and forwards the session request to the dialing device based onthe video telephone protocol. In response to the session request, thedialing device sends a session confirmation to the telephone networknode based on the video telephone protocol. The telephone network nodeforwards the session confirmation based on the IP network.

It should be understood that the IMS system in each embodiment of thepresent disclosure may be deployed in any network node of the 5G corenetwork.

In some examples, first media channels with the first dialing device andthe second dialing device are determined in response to a video call.The identification of the second dialing device is included in the videocall, and the video server may have a first media channel with the firstdialing device. According to the identification of the second dialingdevice, a first media channel between the video server and the seconddialing device is established (e.g., a first video channel) to enable amulti-party video call. For example, when a video call is conductedbased on the first media channel of the first dialing device and thesecond media channel, the method further includes: notifying the firstdialing device and the IP terminal device that the second dialing deviceis about to join in response to the video call. For example, theidentification of the second dialing device is sent to the first dialingdevice and the IP terminal device, and local video processing modules ofthe first dialing device and the IP terminal device can respond to theidentification and adjust corresponding video windows. The first dialingdevice and the IP terminal device can monitor respective local networkenvironments in response to the identification, and send reports to thevideo server. The video server can send a session request to the seconddialing device when both the first dialing device and the first IPterminal device satisfy conditions for a multi-party video call.

In other examples, the video call includes respective identifications ofthe first dialing device and the second dialing device. The video servercan establish first media channels with the first dialing device and thesecond dialing device to realize a multi-party video call based on theidentifications of the first dialing device and the second dialingdevice.

In other examples, a second media channel with the IP terminal device isdetermined in response to the video call. A second media channel withthe IP terminal device can be established in response to the video call.

FIG. 4 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure. The communication methodof FIG. 4 is applied to a telephone network node having an IP networkconnection with a video server.

A telephone network node may be configured with an IP MultimediaSubsystem (IMS) to help establish a media channel capable of videocommunication with the IP network. Messages (e.g., video calls, etc.)received by the telephone network node from a dialing device are incompliance with the communication protocol (e.g., statutory standardprotocol or de facto standard protocol) of the telephone operatordeploying the telephone network. The telephone operator provides IPnetwork traffic services and IP video telephone services over thetelephone network. Initiating a video call corresponds to an IP videotelephone service.

The method includes the following steps:

410: Forward a received video call to the video server based on the IPnetwork connection, wherein a dialing device that initiates the videocall and the telephone network node are located in the telephonenetwork.

420: Receive a session request sent by the video server in response tothe video call based on the IP network connection, and forward thesession request to the dialing device.

430: Send a session confirmation received from the dialing device to thevideo server based on the IP network connection to establish a videochannel between the dialing device and the video server.

440: Forward video call data between the dialing device and the videoserver based on the video channel.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

FIG. 5 is a schematic flowchart of a communication method according toanother embodiment of the present disclosure. The communication methodof FIG. 5 is applied to a dialing device, and includes the followingsteps:

510: Send a video call to a telephone network node.

520: Receive a session request sent by the telephone network node inresponse to the video call, wherein the session request includes anidentification of a video server.

530: Establish a video channel with the video server in response to thesession request.

540: Send video call data to the telephone network node or receive videocall data from the telephone network node based on the video channel.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

Apparatuses corresponding to the above method embodiments will bedescribed below with reference to various embodiments of FIGS. 6-9 .

FIG. 6 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure. Thecommunication apparatus in FIG. 6 is applied to a video server that hasan IP network connection with a telephone network node. The apparatusincludes:

a receiving module 610 that receives a first video call sent by atelephone network node based on an IP network connection, the video callincluding an identification of a first dialing device;

a determination module 620 that determines a first video channel withthe first dialing device and second video channels with a first IPterminal device and a second IP terminal device in response to the firstvideo call, wherein the first dialing device and the telephone networknode are located in a telephone network, and the first IP terminaldevice, the second IP terminal device and the video server are locatedin an IP network; and

a combination module 630 that performs channel combination on the firstvideo channel and the second video channel to establish a multi-partyvideo call between the first dialing device and multiple IP terminaldevices.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

In implementations, the determination module is specifically configuredto: send a session request to the telephone network node based on the IPnetwork connection in response to the first video call, the sessionrequest including an identification of the video server; and receiving asession confirmation based on the IP network connection and establishingthe first video channel, wherein the session confirmation includes anidentification of the first dialing device.

In implementations, the combination module is specifically configured toobtain video call data of the first dialing device and video call dataof the first IP terminal device and the second IP terminal device basedon the first video channel of the first dialing device and the secondvideo channels of the first IP terminal device and the second IPterminal device respectively; generate real-time combined call dataaccording to the video call data of the first dialing device, the firstIP terminal device and the second IP terminal device; and send thereal-time combined call data to the first dialing device, the first IPterminal device, and the second IP terminal device based on the firstvideo channel of the first dialing device and the second video channelsof the first IP terminal device and the second IP terminal devicerespectively.

In implementations, the combination module is specifically configured toobtain video call data of the first dialing device and video call dataof the first IP terminal device and the second IP terminal device basedon the first video channel of the first dialing device and the secondvideo channels of the first IP terminal device and the second IPterminal device respectively; and record the video call data of thefirst dialing device, the first IP terminal device and the second IPterminal device as a target video file in real time, and locally storethe target video file.

In implementations, the determination module is specifically configuredto: query a current service status of each agent terminal device in theIP network in response to the first video call, the current servicestatus indicating being currently available or currently occupied; anddetermine the first IP terminal device and the second IP terminal devicefrom among agent terminal devices whose status indicates being currentlyavailable.

In implementations, the apparatus further includes: a requestencapsulation module that encapsulates a SIP session request using a websocket protocol; a request sending module that sends the SIP sessionrequest to respective installed browser clients of the first IP terminaldevice and the second IP terminal devices; a channel establishing modulethat receives session confirmations sent by the respective browserclients based on the web socket protocol within the IP network, andestablishes the second video channels with the first IP terminal deviceand the second IP terminal device.

In implementations, the apparatus further includes: an inquiryacquisition module that obtains a user inquiry of the first dialingdevice through the first video channel; an identification module thatidentifies the user inquiry, and determines a customer service replycorresponding to the user inquiry; and a reply returning module thatreturns the customer service reply to the first dialing device throughthe first video channel.

In implementations, the apparatus further includes: a channeldetermination module that determines a third video channel with a seconddialing device in response to a second video call, wherein the seconddialing device and the telephone network node are located within thetelephone network; and the combination module is further configured toperform channel combination on the first video channel, the second videochannel and the third video channel to add the second dialing device tothe multi-party video call.

The apparatus of this embodiment is used to implement correspondingmethods in the foregoing method embodiments, and has the beneficialeffects of the corresponding method embodiments, which will not berepeated herein. In addition, for the functional implementations of eachmodule in the apparatus of this embodiment, reference may be made to thedescription of the corresponding parts in the foregoing methodembodiments, and details thereof are not repeated herein.

FIG. 7 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure. Thecommunication apparatus of FIG. 7 is applied to a video server having anIP network connection with a telephone network node, and the apparatusincludes:

an acquisition module 710 that obtains a video call sent by an IPterminal device, the IP terminal device and the video server beinglocated in an IP network;

a determination module 720 that determines first media channels with afirst dialing device and a second dialing device and a second mediachannel with the IP terminal device in response to the video call,wherein the first dialing device and the second dialing device arelocated within a telephone network; and

a combination module 730 that performs channel combination on the firstmedia channel and the second media channels to establish a multi-partyvideo call among the IP terminal device, the first dialing device andthe second dialing device.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

The apparatus of this embodiment is used to implement correspondingmethods in the foregoing method embodiments, and has the beneficialeffects of the corresponding method embodiments, which will not berepeated herein. In addition, for the functional implementations of eachmodule in the apparatus of this embodiment, reference may be made to thedescription of the corresponding parts in the foregoing methodembodiments, and details thereof are not repeated herein.

FIG. 8 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure. Thecommunication apparatus of FIG. 8 is applied to a telephone network nodehaving an IP network connection with a video server, and the apparatusincludes:

a first transceiver module 810 that forwards a received video call tothe video server based on the IP network connection, wherein a dialingdevice that initiates the video call and the telephone network node arelocated in a telephone network;

a second transceiver module 820 that receives a session request sent bythe video server in response to the video call based on the IP networkconnection, and forwards the session request to the dialing device;

a third transceiver module 830 that sends the session confirmationreceived from the dialing device to the video server based on the IPnetwork connection to establish a video channel between the dialingdevice and the video server; and

a fourth transceiver module 840 that forwards video call data betweenthe dialing device and the video server based on the video channel.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

The apparatus of this embodiment is used to implement correspondingmethods in the foregoing method embodiments, and has the beneficialeffects of the corresponding method embodiments, which will not berepeated herein. In addition, for the functional implementations of eachmodule in the apparatus of this embodiment, reference may be made to thedescription of the corresponding parts in the foregoing methodembodiments, and details thereof are not repeated herein.

FIG. 9 is a schematic block diagram of a communication apparatusaccording to another embodiment of the present disclosure. Thecommunication apparatus of FIG. 9 is applied to dialing device, and theapparatus includes:

a first transceiver module 910 that sends a video call to a telephonenetwork node;

a second transceiver module 920 that receives a session request sent bythe telephone network node in response to the video call, wherein thesession request includes an identification of a video server;

an establishing module 930 that establishes a video channel with thevideo server in response to the session request; and

a third transceiver module 940 that sends the video call data to thetelephone network node or receives video call data from the telephonenetwork node based on the video channel.

In the present disclosure, since the video server has an IP networkconnection with the telephone network node, a video channel with thedialing device located in the same telephone network of the telephonenetwork node is realized, which realizes video data transmission betweenthe IP network and the telephone network. In addition, since therespective video channels of multiple IP terminal devices can beconfigured in the IP network, a multi-party video call between themultiple IP terminal devices and the dialing device is realized, whichimproves the performance of the video service capabilities of the IPnetwork and the telephone network, and saves the communication cost.

The apparatus of this embodiment is used to implement correspondingmethods in the foregoing method embodiments, and has the beneficialeffects of the corresponding method embodiments, which will not berepeated herein. In addition, for the functional implementations of eachmodule in the apparatus of this embodiment, reference may be made to thedescription of the corresponding parts in the foregoing methodembodiments, and details thereof are not repeated herein.

FIG. 10 is a hardware structure of an electronic device according toanother embodiment of the present disclosure. In implementations, theapparatuses, the video server, the telephone network node, the IPterminal devices (such as the first IP terminal device, the second IPterminal device, etc.), the dialing devices (such as the first dialingdevice, the second dialing device, etc.) as described in the foregoingdescription may each include one or more electronic devices. As shown inFIG. 10 , the hardware structure of the electronic device may include: aprocessor 1001, a communication interface 1002, a memory 1003, and acommunication bus 1004.

The processor 1001, the communication interface 1002, and the memory1003 communicate with each other through the communication bus 1004.

Optionally, the communication interface 1002 may be an interface of acommunication module.

The processor 1001 may be specifically configured to: receive a firstvideo call sent by a telephone network node based on an IP networkconnection, wherein the video call includes an identification of a firstdialing device; determine a first video channel with the first dialingdevice and second video channels with a first IP terminal device and asecond IP terminal device in response to the first video call, whereinthe first dialing device and the telephone network node are located in atelephone network, and the first IP terminal device, the second IPterminal device and the video server are located in an IP network; andperform channel combination on the first video channel and the secondvideo channels to establish a multi-party video call between the firstdialing device and multiple IP terminal devices.

Alternatively, the processor 1001 may be specifically configured to:obtain a video call sent by an IP terminal device, wherein the IPterminal device and a video server are located in an IP network;determine first media channels with a first dialing device and a seconddialing device in response to the video call, and a second media channelwith the IP terminal device, wherein the first dialing device and thesecond dialing device are located in a telephone network; performchannel combination on the first media channels and the second mediachannel to establish a multi-party video call among the IP terminaldevice, the first dialing device and the second dialing device.

Alternatively, the processor 1001 may be specifically configured to:forward a received video call to a video server based on an IP networkconnection, wherein a dialing device that initiates the video call and atelephone network node are located in a telephone network; receiving asession request sent by the video server based on the IP networkconnection in response to the video call, and forward the sessionrequest to the dialing device; send a session confirmation received fromthe dialing device to the video server based on the IP networkconnection to establish a video channel between the dialing device andthe video server; and forward video call data between the dialing deviceand the video server based on the video channel.

Alternatively, the processor 1001 may be specifically configured to:send a video call to a telephone network node; receive a session requestsent by the telephone network node in response to the video call,wherein the session request includes an identification of a videoserver; and send video call data to the telephone network node orreceive video call data from the telephone network node based on thevideo channel.

The above-mentioned processor may be a general-purpose processor, whichincludes a central processing unit (abbreviated as CPU), a networkprocessor (abbreviated as NP), etc.; and may also be a digital signalprocessor (DSP), an application-specific integrated circuit (ASIC),field programmable gate array (FPGA) or other programmable logicdevices, discrete gates or transistor logic devices, discrete hardwarecomponents. The methods, steps, and logical block diagrams disclosed inthe embodiments of the present disclosure can be implemented orexecuted. A general purpose processor may be a microprocessor or theprocessor may be any conventional processor or the like.

The above-mentioned memory may be, but is not limited to, random accessmemory (RAM), read only memory (ROM), programmable read-only memory(PROM), erasable programmable read-only memory (EPROM), electricalerasable programmable read-only memory (EEPROM), etc. Inimplementations, the memory may include one or more program modules ofthe apparatuses as described in the foregoing description.

In particular, according to the embodiments of the present disclosure,the processes described above with reference to the flowcharts may beimplemented as computer software programs. For example, the embodimentsof the present disclosure include a computer program product, whichincludes a computer program carried on a storage medium. The computerprogram includes program codes configured to perform the methods shownin the flowcharts. In these embodiments, the computer program may bedownloaded and installed from a network via a communication component,and/or installed from a removable medium. When this computer program isexecuted by a central processing unit (CPU), the above-mentionedfunctions defined in the methods of the present disclosure areperformed. It should be noted that the storage medium described in thepresent disclosure may be a computer readable signal medium or acomputer readable storage medium, or any combination thereof. Thestorage medium may be, for example, but not limited to, an electrical,magnetic, optical, electromagnetic, infrared, or semiconductor system,apparatus or device, or a combination of any of the above. More specificexamples of computer readable storage media may include, but are notlimited to, electrical connections with one or more wires, portablecomputer disks, hard disks, random access memory (RAM), read only memory(ROM), erasable programmable read-only memory (EPROM or flash memory),optical fibers, portable compact disk read only memory (CD-ROM), opticalstorage media devices, magnetic storage media devices, or any suitablecombination of the above. In the present disclosure, a computer readablestorage medium may be any tangible medium that includes or stores aprogram that can be used by or used in conjunction with an instructionexecution system, apparatus, or device. As defined herein, the computerreadable storage medium does not include transitory media, such asmodulated data signals and carrier waves.

In the present disclosure, however, a computer readable signal mediummay include a data signal propagated in baseband or as a part of acarrier wave, which carries computer readable program codes therein.Such propagated data signal may take a variety of forms, which include,but are not limited to electromagnetic signals, optical signals, or anysuitable combination of the above. A computer readable signal medium canalso be any storage medium other than a computer readable storagemedium. Such storage medium can transmit, propagate, or transport aprogram that can be used by or used in conjunction with an instructionexecution system, apparatus, or device. Program codes embodied on thestorage medium may be transmitted using any suitable medium including,but not limited to, wireless, electrical wires, optical fiber cables,RF, etc., or any suitable combination of the above.

Computer program codes configured to perform operations of the presentdisclosure may be written in one or more programming languages, whichinclude object-oriented programming languages—such as Java, Smalltalk,C++, but also conventional procedural programming languages—such as “C”language or similar programming languages. The program codes may executeentirely on a user's computer, or partly on the user's computer, or as astand-alone software package, partly on the user's computer and partlyon a remote computer, or entirely on the remote computer or server. Whena remote computer is involved, the remote computer may be connected tothe user's computer through any kind of network, which includes a localarea network (LAN) or a wide area network (WAN), or may be connected toan external computer (e.g., using an Internet service provider toconnect through the Internet).

The flowcharts and block diagrams in the accompanying drawingsillustrate the architectures, functionalities, and operations ofpossible implementations of systems, methods and computer programproducts according to various embodiments of the present disclosure. Inthis regard, each block in a flowchart or block diagram may represent amodule, a segment, or a portion of code that includes one or moreexecutable instructions configured to implement specified logicalfunctions. There are specific ordering relationships in the abovespecific embodiments, but these ordering relationships are onlyexemplary. When being implemented in practice, these steps may be fewer,more, or an order of execution may be adjusted. In other words, in somealternative implementations, the functions noted in the blocks may occurout of order as indicated in the associated figure. For example, twosuccessive shown blocks may, in practice, be executed substantiallyconcurrently, or the blocks may sometimes be executed in a reverseorder, depending on the functionalities that are involved. It is alsonoted that each block of a block diagram and/or flowchart, and acombination of blocks in the block diagram and/or flowchart, may beimplemented in a dedicated hardware-based system that performs specifiedfunctions or operations, or can be implemented in a combination ofdedicated hardware and computer instructions.

The modules involved in the embodiments of the present disclosure may beimplemented in the form of software, and may also be implemented in theform of hardware. Under certain conditions, the names of these modulesin no way impose any limitation on the modules.

As another aspect, the present disclosure also provides a storage mediumon which a computer program is stored. When the program is executed by aprocessor, the method(s) described in the above embodiments is/areimplemented.

As another aspect, the present disclosure also provides a storagemedium, which may be included in the apparatus that is described in theabove embodiments, or may exist alone without being assembled into thatapparatus. The storage medium carries one or more programs. When beingexecuted by the apparatus, the one or more programs cause the apparatusto implement the methods of the above-mentioned embodiments.

The expressions “first,” “second,” “the first,” or “the second,” as usedin various embodiments of the present disclosure, may modify variouselements regardless of order and/or importance, but these expressions donot limit the corresponding parts. The above expressions are only usedfor the purpose of distinguishing an element from other elements. Forexample, a first user device and a second user device representdifferent user devices, though both are user devices. For example, afirst element could be called as a second element, and, similarly, asecond element could be called as a first element, without departingfrom the scope of the present disclosure.

When an element (e.g., a first element) is referred to as being“(operatively or communicatively) coupled with” another element (e.g., asecond element) or “(operatively or communicatively) coupled to” anotherelement (e.g., a second element), or “connected to” another element(e.g., a second element), it should be understood that such element isdirectly connected to the other element or such element is indirectlyconnected to the other element through yet another element (e.g., athird element). In contrast, it can be understood that when an element(e.g., a first element) is referred to as being “directly connected” or“directly coupled” to another element (e.g., a second element), noelement (e.g., a third element) is interposed between the two.

The above merely describes exemplary embodiments of the presentdisclosure and an illustration of the technical principles that areapplied. It should be understood by one skilled in the art that theinventive scope involved in the present disclosure is not limited to thetechnical solutions formed by the specific combinations of the abovetechnical features, and shall also cover the above technical features orother technical solutions formed by any combination of their equivalentfeatures, without departing from the above inventive concept. Forexample, a technical solution is formed by replacing the above-mentionedfeatures with (but not limited to) technical features having functionssimilar to what are disclosed in the present disclosure.

What is claimed is:
 1. A method implemented by a video server, themethod comprising: receiving a first video call sent by a telephonenetwork node based on an Internet Protocol (IP) network connection withthe video server, the first video call including an identification of afirst dialing device; determining a first video channel with the firstdialing device, and second video channels with a first IP terminaldevice and a second IP terminal device in response to the first videocall, wherein the first dialing device and the telephone network nodeare located in a telephone network, and the first IP terminal device,the second IP terminal device and the video server are located in an IPnetwork, wherein determining the second video channels with the first IPterminal device and the second IP terminal device in response to thefirst video call comprises: querying a current service status of eachagent terminal device in the IP network in response to the first videocall, wherein the current service status indicates being currentlyavailable or currently occupied; and determining the first IP terminaldevice and the second IP terminal device from among agent terminaldevices whose current service status indicates being currentlyavailable; and performing channel combination on the first video channeland the second video channels to establish a multi-party video callbetween the first dialing device and multiple IP terminal devices. 2.The method according to claim 1, wherein determining the first videochannel with the first dialing device in response to the first videocall comprises: sending a session request to the telephone network nodebased on the IP network connection in response to the first video call,where the session request including an identification of the videoserver; and receiving a session confirmation sent by the telephonenetwork node based on the IP network connection, and establishing thefirst video channel, wherein the session confirmation includes theidentification of the first dialing device.
 3. The method according toclaim 2, wherein performing the channel combination on the first videochannel and the second video channels comprises: obtaining video calldata of the first dialing device and video call data of the first IPterminal device and the second IP terminal device based on the firstvideo channel of the first dialing device and the second video channelsof the first IP terminal device and the second IP terminal devicerespectively; generating real-time combined call data according to thevideo call data of the first dialing device, the first IP terminaldevice and the second IP terminal device; and sending the real-timecombined call data to the first dialing device, the first IP terminaldevice and the second IP terminal device based on the first videochannel of the first dialing device and the second video channels of thefirst IP terminal device and the second IP terminal device respectively.4. The method according to claim 2, wherein performing the channelcombination on the first video channel and the second video channelscomprises: obtaining video call data of the first dialing device andvideo call data of the first IP terminal device and the second IPterminal device based on the first video channel of the first dialingdevice and the second video channels of the first IP terminal device andthe second IP terminal device respectively; and recording the video calldata of the first dialing device, the first IP terminal device and thesecond IP terminal device in real time as a target video file, andlocally storing the target video file.
 5. The method according to claim1, further comprising: encapsulating a SIP session request using a websocket protocol; sending the SIP session request to respective browserclients installed on the first IP terminal device and the second IPterminal device within the IP network; and receiving sessionconfirmations sent by the respective browser clients based on the websocket protocol within the IP network, and establishing the second videochannels with the first IP terminal device and the second IP terminaldevice.
 6. The method according to claim 1, further comprising:obtaining a user inquiry of the first dialing device through the firstvideo channel; identifying the user inquiry, and determining a customerservice response corresponding to the user inquiry; returning thecustomer service response to the first dialing device through the firstvideo channel.
 7. The method according to claim 1, further comprising:determining a third video channel with a second dialing device inresponse to a second video call, wherein the second dialing device andthe telephone network node are located within the telephone network; andperforming the channel combination on the first video channel, thesecond video channels and the third video channel to add the seconddialing device to the multi-party video call.
 8. One or more computerreadable storage media storing executable instructions that, whenexecuted by one or more processors of a video server, cause the one ormore processors to perform acts comprising: receiving a first video callsent by a telephone network node based on an Internet Protocol (IP)network connection with the video server, the first video call includingan identification of a first dialing device; determining a first videochannel with the first dialing device, and second video channels with afirst IP terminal device and a second IP terminal device in response tothe first video call, wherein the first dialing device and the telephonenetwork node are located in a telephone network, and the first IPterminal device, the second IP terminal device and the video server arelocated in an IP network; performing channel combination on the firstvideo channel and the second video channels to establish a multi-partyvideo call between the first dialing device and multiple IP terminaldevices; encapsulating a SIP session request using a web socketprotocol; sending the SIP session request to respectively browserclients installed on the first IP terminal device and the second IPterminal device within the IP network; and receiving sessionconfirmations sent by the respective browser clients based on the websocket protocol within the IP network, and establishing the second videochannels with the first IP terminal device and the second IP terminaldevice.
 9. The one or more computer readable storage media according toclaim 8, wherein determining the first video channel with the firstdialing device in response to the first video call comprises: sending asession request to the telephone network node based on the IP networkconnection in response to the first video call, where the sessionrequest including an identification of the video server; and receiving asession confirmation sent by the telephone network node based on theewIP network connection, and establishing the first video channel,wherein the session confirmation includes the identification of thefirst dialing device.
 10. The one or more computer readable storagemedia according to claim 9, wherein performing the channel combinationon the first video channel and the second video channels comprises:separately obtaining video call data of the first dialing device andvideo call data of the first IP terminal device and the second IPterminal device based on the first video channel of the first dialingdevice and the second video channels of the first IP terminal device andthe second IP terminal device; generating real-time combined call dataaccording to the video call data of the first dialing device, the firstIP terminal device and the second IP terminal device; and separatelysending the real-time combined call data to the first dialing device,the first IP terminal device and the second IP terminal device based onthe first video channel of the first dialing device and the second videochannels of the first IP terminal device and the second IP terminaldevice.
 11. The one or more computer readable storage media according toclaim 9, wherein performing the channel combination on the first videochannel and the second video channels comprises: separately obtainingvideo call data of the first dialing device and video call data of thefirst IP terminal device and the second IP terminal device based on thefirst video channel of the first dialing device and the second videochannels of the first IP terminal device and the second IP terminaldevice; and recording the video call data of the first dialing device,the first IP terminal device and the second IP terminal device in realtime as a target video file, and locally storing the target video file.12. The one or more computer readable storage media according to claim8, wherein determining the second video channels with the first IPterminal device and the second IP terminal device in response to thefirst video call comprises: querying a current service status of eachagent terminal device in the IP network in response to the first videocall, wherein the current service status indicates being currentlyavailable or currently occupied; and determining the first IP terminaldevice and the second IP terminal device from among agent terminaldevices whose current service status indicates being currentlyavailable.
 13. The one or more computer readable storage media accordingto claim 8, the acts further comprising: obtaining a user inquiry of thefirst dialing device through the first video channel; identifying theuser inquiry, and determining a customer service response correspondingto the user inquiry; returning the customer service response to thefirst dialing device through the first video channel.
 14. The one ormore computer readable storage media according to claim 8, the actsfurther comprising: determining a third video channel with a seconddialing device in response to a second video call, wherein the seconddialing device and the telephone network node are located within thetelephone network; and performing the channel combination on the firstvideo channel, the second video channels and the third video channel toadd the second dialing device to the multi-party video call.
 15. A videoserver comprising: one or more processors; memory storing executableinstructions that, when executed by the one or more processors, causethe one or more processors to perform acts comprising: receiving a firstvideo call sent by a telephone network node based on an InternetProtocol (IP) network connection with the video server, the first videocall including an identification of a first dialing device; determininga first video channel with the first dialing device, and second videochannels with a first IP terminal device and a second IP terminal devicein response to the first video call, wherein the first dialing deviceand the telephone network node are located in a telephone network, andthe first IP terminal device, the second IP terminal device and thevideo server are located in an IP network, wherein determining thesecond video channels with the first IP terminal device and the secondIP terminal device in response to the first video call comprises:querying a current service status of each agent terminal device in theIP network in response to the first video call, wherein the currentservice status indicates being currently available or currentlyoccupied; and determining the first IP terminal device and the second IPterminal device from among agent terminal devices whose current servicestatus indicates being currently available; and performing channelcombination on the first video channel and the second video channels toestablish a multi-party video call between the first dialing device andmultiple IP terminal devices.
 16. The video server according to claim15, wherein determining the first video channel with the first dialingdevice in response to the first video call comprises: sending a sessionrequest to the telephone network node based on the IP network connectionin response to the first video call, where the session request includingan identification of the video server; and receiving a sessionconfirmation sent by the telephone network node based on the IP networkconnection, and establishing the first video channel, wherein thesession confirmation includes the identification of the first dialingdevice.
 17. The video server according to claim 16, wherein performingthe channel combination on the first video channel and the second videochannels comprises: separately obtaining video call data of the firstdialing device and video call data of the first IP terminal device andthe second IP terminal device based on the first video channel of thefirst dialing device and the second video channels of the first IPterminal device and the second IP terminal device; generating real-timecombined call data according to the video call data of the first dialingdevice, the first IP terminal device and the second IP terminal device;and separately sending the real-time combined call data to the firstdialing device, the first IP terminal device and the second IP terminaldevice based on the first video channel of the first dialing device andthe second video channels of the first IP terminal device and the secondIP terminal device.